RAW SEQUENCE LISTING 



The Biotechnology Systems Branch of the Scientific and Technical 
Information Center (STIC) no errors detected. 

Application Serial Number: /o/5"75^ 5^6(7/4 

Source: /^/^ 

Date Processed by STIC: ///Z/oh 



ENTERED 



RAW SEQUENCE LISTING DATE: 11/03/2006 

PATENT APPLICATION: US/10/575 , 560A TIME: 10:30:26 

. Input Set : A:\33412_SEQ ID. . fcxit 

Output Set: N:\CRF4\11032006\J575560AVraw 

. 3 <110> APPLICANT: Sandoz GmbH 

5 <120> TITLE OF INVENTION: Process for Producing Penicillin 

7 <130> FILE REFERENCE: 33412US-PCT 

9 <140> CURRENT APPLICATION NUMBER: US/lO/575 , 560A 

10 <141> CURRENT FILING DATE: 2006-04-11 

12 <150> PRIOR APPLICATION NUMBER: PCT/EP2004/011566 

13 <151> PRIOR FILING DATE: 2006-10-14 
15 <160> NUMBER OF SEQ ID NOS : 8 

17 <170> SOFTWARE: Patentin version 3.1 

19 <2:.0> SEQ ID NO: 1 

20 <211> LENGTH: 411 ^ " 

21 <212> TYPE: PRT 

22 <213> ORGANISM: Penicillium chrysogenum 
24 <400> SEQUENCE: 1 



26 


Met 


Val 


Asp 


Pro 


Ser 


Val 


Ser Gly 


He 


Thr 


Lys 


Met Asp Thr Asn Asp 


27 
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30 


He 


Lys 
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He 


Pro 


Lys 


Asp 


Gin 


Pro 


Thr Leu 


Val 


Arg Trp 


31 
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25 








30 




34 


Tyr 


Met 


Asp 


Val 


Arg 


Arg 


Trp 


Asp 


Glu 


Lys 


Tyr 


Phe Asp 


Leu 


Pro Leu 


35 






35 










40 








45 






38 


Leu 


Glu 


Thr 


Leu 


Thr 


Gin 


Pro Asp 


Gin 


Ala 


Ala 


Val Lys 


Lys 


Tyr Tyr 


39 




50 










55 










60 






42 


Gin 


Thr 


Ser 


Asp 


Lys 


Arg 


Leu 


Ser 


Leu 


Ala 


Ser 


Gin Leu 


Leu 


Lys Tyr 


43 


65 










70 










75 






80 


46 


Tyr 


Tyr 


He 


His 


Gin 


Ala 


Thr Gly 


Thr 


Pro 


Trp 


Ser Lys 


He 


Glu He 


47 










85 










90 








95 


50 


Gin 


Arg 


Thr 


Pro 


Met 


Pro 


Glu 


Asn 


Arg 


Pro 


Phe 


Tyr Asp 


Ser 


Ser Leu 


51 








100 










105 








110 




54 


Asp 


Phe 


Asn 


Val 


Ser 


His 


Gin 


Ala 


Gly 


Leu 


Thr Leu Phe Ala Gly Thr 


55 






115 










120 








125 






58 


Arg 


Ala 


Ala 


Thr 


Ala 


His 


Ser 


Leu 


Ser 


Gly 


Gly 


Pro Gin 


Thr 


Leu Pro 


59 
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135 










140 






62 


Arg 


Val 


Gly 


He 


Asp 


Val 


Ala 


Cys 


Val 


Asp 


Glu 


Pro Ser Arg Arg Arg 


63 


145 










150 
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160 


66 


Ala 


Asn 


Arg 


Pro 


Pro 


Lys 


Thr 


Leu 


Ala 


Asp 


Leu 


Ala Thr 


Phe 


Val Asp 


67 
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170 








175 


70 


Val 


Phe 


Ser 


Asp 


Val 


Leu 


Ser 


Leu 


Arg 


Glu 


Leu 


Ala Thr 


He 


Lys Asn 


71 








180 










185 








190 




74 


Pro 


Tyr 


Ala 


Thr 


Leu 


Lys 


Leu 


Ala 


Arg 


Glu 


Leu Gly Leu Asn Lys Ser 


75 






195 










200 
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78 


Asp 


Pro 


Ser 


Lys 


Asp 


Asp 


Gin 


Glu 


Val 


Leu 


Ala Ala Tyr Gly He Arg 


79 
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Phe 


Tyr 
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He 


Trp 
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Leu 


Lys 


Glu 
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Tyr Leu 


Lys 


Met Thr 
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RAW SEQUENCE LISTING DATE: 11/03/2006 

PATENT APPLICATION: US/10/575 , 560A TIME: 10:30:26 

Input Set : A:\33412_SEQ ID. ST25.txt 
Output Set: N:\CRP4\11032006\J575560A.raw 

83 225 230 235 240 

86 Gly Asp Gly Leu Leu Ala Ser Trp lie Lys Asp Leu Glu Phe Thr Asn 

87 245 250 255 

90 Val Val Pro Pro Glu Pro Val Gin Thr Val Gly Phe Ala Gly Asp Pro 

91 260 265 270 

94 Ser Ala Thr His Ala Pro Ser Val Gin Asn Trp Gly Arg Pro Tyr Ser 

95 275, „ 280 285 > - 

98 Asp Val Lys lie Ser Leu Arg Gly lie Pro Asp His Ser Val Arg Val' 

99 290 295 300 



102 


Gin 


Pro 


Val 


Gly 


Phe 


Glu 


Ser 


Asp 


Tyr 


He 


Val 


Ala 


Thr Ala Ala Ser 


103 


305 










310 










315 




320 


106 


Gly 


Pro 


Asn 


He 


Gly 


Ser 


Val 


Ser 


Arg 


Gin 


Val 


Val 


Val Asn Asp Ser 


107 










325 










330 






335 


110 


Asp 


His 


His 


Leu 


Pro 


Gly 


Arg 


He 


Thr 


Ala 


Phe 


Asp 


Ser Glu Thr Gly 


111 








340 










345 








350 


114 


Leu 


Gin 


Asn 


Val 


Arg 


He 


Pro 


Pro 


He 


Ala 


Leu 


Arg 


Ser He Gly Asp 


115 






355 










360 










365 


118 


Gly 


Asp 


Pro 


Trp 


Arg 


Val 


Asp 


Ser 


Lys 


He 


Ser 


Asp 


Pro Trp Leu Pro 


119 




370 










.375 










380 




122 


Met 


Gin 


Glu 


Val 


Asp 


He 


Glu 


He 


Asp 


He 


Arg 


Pro 


Cys Ala Asp Gly 


123 


385 










390 










395 




400 


126 


Arg 


Cys 


Glu 


His 


Leu 


Arg 


Asp 


Leu 


Pro 


Ser 


Phe 







127 405 410 

130 <210> SEQ ID NO: 2 

131 <211> LENGTH: 1284 

132 <212> TYPE: DNA 

133 <213> ORGANISM: Penicillium chrysogenum 
135 <400> SEQUENCE: 2 
13 6 atggtagacc ccagtgtgtc 
138 actgaccaat ttcagaccaa 
140 gaccagccca cgttggtccg 
142 gatctccctt tgcttgaaac 
144 caaacatcgg acaagcgcct 
146 caagccactg gcactccctg 
148 cgaccattct acgattcaag 
150 ttcgcaggca cgcgtgccgc 
152 cgcgtgggaa ttgacgttgc 
154 ccgaagacac ttgccgacct 
156 cgtgagcttg cgaccatcaa 
158 ctgaataaaa gtgacccgag 
160 ctgttctact cgatttgggc 
162 ctggcctctt ggataaagga 
164 acagtcggat ttgctggtga 
166 cggccttact ccgatgtcaa 
168 cagctcgtcg gcttcgagtc 
170 ggatccgttt cgcggcaggt 
172 acagccttcg actctgagac 
174 tcaattggcg atggggaccc 
176 atgcaggagg tcgatattga 



tggaattgtg agtagccaca tagcctccat gagtgcaccc 60 

aatggatacc aatgatatca aacagaatga catccccaag 120 

atggtacatg gatgtcagac gttgggatga aaaatacttt 180 

cttaacacag cctgatcagg cagctgtcda gaagtactat 240 

gtccttggcc tcccagttgc tgaaatatta ctacattcac 300 

gagcaagatt gagatccagc gtactccgat gcccgaaaat 360 

cctggatttc aacgtcagcc atcaggctgg tctcactctg 420 

aacagcccac tccttatccg gtggacctca aacattgcct 480 

gtgtgttgat gaaccctctc gtcgtcgtgc taatcgtccc 540 

tgcaaccttc gtggatgtct tcagtgacgt tctctcactc 600 

gaacccgtac gcgactctta aattggctcg tgagcttggt 660 

caaagacgac caggaagtcc ttgctgccta cggcattcgg 720 

tctcaaggag gcttacttga aaatgaccgg agacggcctt 780 

tctggaattc acaaacgttg ttccccccga accagttcaa 840 

tccttctgcc actcacgcgc cctcggtcca aaattggggc 900 

aatctccttg cgtggcattc ctgaccattc tgtgcgcgtt 960 

cgactacata gttgccacgg ccgcgtcggg ccccaatatt 1020 

agtcgtgaat gacagcgatc accatctgcc agggcgtatc 1080 

tggactccag aacgtccgca ttcccccaat cgcgcttcga 1140 

ctggcgtgtg gactcgaaaa tcagcgaccc ctggctcccc 1200 

aatcgatatc cggccctgtg cggatggtcg ttgcgagcac 12 60 



file ://C: \CRF4\Outhold\VsrJ575560A.htm 



11/3/2006 



Page 3 of 7 



RAW SEQUENCE LISTING DATE: 11/03/2006 

PATENT APPLICATION: US/10/575 , 560A TIME: 10:30:26 

Input Set : A:\33412_SEQ ID. ST25.txt 
Output Set: N:\CRP4\11032006\J575560A.raw 

178 ctacgggatt taccaagctt ttaa 1284 

181 <210> SEQ ID NO: 3 

182 <211> LENGTH: 1236 

183 <212> TYPE: DNA 

184 <213> ORGANISM: Penicilliutn chrysogenum 

186 <400> SEQUENCE: 3 

187 atggtagacc ccagtgtgtc 
189 gacatcccca aggaccagcc 
191 gaaaaatact ttgatctccc 
193 aagaagtact atcaaacatc 
195 tactacattc accaagccac 
197 atgcccgaaa atcgaccatt 
199 ggtctcactc tgttcgcagg 
201 caaacattgc ctcgcgtggg 
203 gctaatcgtc ccccgaagac 
205 gttctctcac tccgtgagct 
207 cgtgagcttg gtctgaataa 
209 tacggcattc ggctgttcta 
211 . ggagacgg.cc. -.ttctggcctc 
213 gaaccagttc aaacagtcgg 
215 caaaattggg gccggcctta 
217 tctgtgcgcg ttcagcccgt 
219 ggccccaata ttggatccgt 
221 ccagggcgta tcacagcctt 
223 atcgcgcttc gatcaattgg 
225 ccctggctcc ccatgcagga 
227 cgttgcgagc acctacggga 

230 <210> SEQ ID NO: 4 

231 <211> LENGTH: 3188 

232 <212> TYPE: DNA 

233 <213> ORGANISM: Penicilliutn chrysogenum 

235 <400> SEQUENCE: 4 

236 gtcgaccgaa gtggtttcgg ttcactcgca catcaagacc accgatcagc tcttgcccgc 60 
238 ccttctttgt cttgttggca gactcggcaa gcaaaatgag cccggcgcat gtaccccacg 120 
240 tcggtttgcg atccactctg cataacccac gtattagatc gaattgatat ggactaaccc 180 
242 ggttcactca ctttacgaat tctcgcagtg gctcgagaag atttgacctt gctgcgacta 240 
244 aagacatagt ggtactctcg cctccgggca agaccaggcc gtcgcatgtt gccagttctt 300 
246 gtggcgtccg tacttcaatg aagtgccatt ccgacggctg cgcttgctca gcggcctttt 360 
248 tcaaaagctg cacatgctca aagaatgcgc cctgtagggc caggactcca acagtgatag 420 
250 ccatttcctc tgaagatcgg aattgcggac cctccgagct cgggtgcttc ttgatattga 480 
252 tgactctttt taaagcacat gactttgact ttccggcggg gaacgtatca acacgtgatg 540 
254 gcggcttatc tccatcttta attccacgcg acatcaggat atcgtgagag ctctcggacg 600 
256 attcctgcgc actttgaaaa cagactgcat aaccgaggca ttatagtata aaacaaatag 660 
258 actcacctac' agaaagagtg ataagttagg tcctatacct gtttccaatg tttctctctc 720 
260 ttgctggatc agctttaaca tatctatgga tggtatcttg gatagtcata gtcatattgc 780 
262 gcttgctatt gcatgtctct ttgctacatc ctatttatgg tattatgtac acggcctgtt 840 
264 tctcgtttgc cggcctattg atgtatacat gtattggtgt aggtagttat tgcctcgcct 900 
266 tatcgacacg tgctgataga taaggacccc gataagacgc caacatggct tctatccagg 960 
268 tgtggatgct ccgcatccaa ggtgcgaata tacgagatca caatgcaatg gtagacccca 1020 



tggaattacc 
cacgttggtc 
tttgcttgaa 
ggacaagcgc 
tggcactccc 
ctacgattca 
cacgcgtgcc 
aattgacgtt 
acttgccgac 
tgcgaccatc 
aagtgacccg 
ctcgatttgg 
ttggataaag 
atttgctggt 
ctccgatgtc 
cggcttcgag 
ttcgcggcag 
cgactctgag 
cgatggggac 
ggtcgatatt 
tttaccaagc 



aaaatagata 
cgatggtaca 
accttaacac 
ctgtccttgg 
tggagcaaga 
agcctggatt 
gcaacagccc 
gcgtgtgttg 
cttgcaacct 
aagaacccgt 
agcaaagacg 
gctctcaagg 
gatctggaat 
gatccttctg 
aaaatctcct 
tccgactaca 
gtagtcgtga 
actggactcc 
ccctggcgtg 
gaaatcgata 
ttttaa 



ccaatgatat 
tggatgtcag 
agcctgatca 
cctcccagtt 
ttgagatcca 
tcaacgtcag 
actccttatc 
atgaaccctc 
tcgtggatgt 
acgcgactct 
accaggaagt 
aggcttactt 
tcacaaacgt 
ccactcacgc 
tgcgtggcat 
tagttgccac 
atgacagcga 
agaacgtccg 
tggactcgaa 
tccggccctg 



caaacagaat 
acgttgggat 
ggcagctgtc 
gctgaaatat 
gcgtactccg 
ccatcaggct 
cggtggacct 
tcgtcgtcgt 
cttcagtgac 
taaattggct 
ccttgctgcc 
gaaaatgacc 
tgttcccccc 
gccctcggtc 
tcctgaccat 
ggccgcgtcg 
tcaccatctg 
cattccccca 
aatcagcgac 
tgcggatggt 



6.0 
120- 
180 
240 
300. 
360 
420 
480 
540 
600 
660 
720 

7 80* to 
840 ' 
900 
960 
1020 
1080 
1140 
1200 
1236 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/575 , 560A 



DATE: 11/03/2006 
TIME: 10:30:26 



Input Set : A:\33412_SEQ ID. ST25.txt 
Output Set: N:\CRF4\11032006\J575560A-raw 



270 gtgtgtctgg aattgtgagt agccacatag cctccatgag tgcacccact gaccaatttc 1080 

272 agaccaaaat ggataccaat gatatcaaac agaatgacat ccccaaggac cagcccacgt 1140 

274 tggtccgatg gtacatggat gtcagacgtt gggatgaaaa atactttgat ctccctttgc 1200 

276 ttgaaacctt aacacagcct gatcaggcag ctgtcaagaa gtactatcaa acatcggaca 1260 

278 agcgcctgtc cttggcctcc cagttgctga aatattacta cattcaccaa gccactggca 1320 

280 ctccctggag caagattgag atccagcgta ctccgatgcc cgaaaatcga ccattctacg 1380 

282 attcciagcc.t..,ggatttcaac gtcagccatc aggctggtct cactctgttc gGaggca^gc 1440 

284 gtgccgcaac agcccactcc 'ttatccggtg gacctcaaac attgcctcgc gtgggaattg 1500"' 

286 acgttgcgtg tgttgatgaa ccctctcgtc gtcgtgctaa tcgtcccccg aagacacttg 1560 

288 ccgaccttgc aaccttcgtg gatgtcttca gtgacgttct ctcactccgt gagcttgcga 1620 

290 ccatcaagaa cccgtacgcg actcttaaat tggctcgtga gcttggtctg aataaaagtg 1680 

292 acccgagcaa agacgaccag gaagtccttg ctgcctacgg cattcggctg ttctactcga 1740 

294 tttgggctct caaggaggct tacttgaaaa tgaccggaga cggccttctg gcctcttgga 1800 

296 taaaggatct ggaattcaca aacgttgttc cccccgaacc agttcaaaca gtcggatttg 1860 

298 ctggtgatcc ttctgccact cacgcgccct cggtccaaaa ttggggccgg ccttactccg 1920 

300 atgtcaaaat ctccttgcgt ggcattcctg accattctgt gcgcgttcag ctcgtcggct 1980 

302 tcgagtccga ctacatagtt gccacggccg cgtcgggccc caatattgga tccgtttcgc 2 040 

3 04 ggcaggtagt cgtgaatgac agcgatcacc atctgccagg gcgtatcaca gccttcgact 2100 

306. ctgagactog .^ctccagaac gtccgcattc ccccaatcgc, gcttcgatca attggogatg 21^^**** 

308 gggacccctg^'gcgtgtggac tcgaaaatca gcgacccctg gctccccatg caggaggtcg "2220 

310 atattgaaat cgatatccgg ccctgtgcgg atggtcgttg cgagcaccta cgggatttac 2280 

312 caagctttta aattccttct tgctgggata tgaccaggcg accatgcacc cgagttattt 2340 

314 gcatattgca tctcctcatc tcatattcct ttctgagcgt gtttttcgga gcgataatta 2400 

316 cccttgaaca tatttctgca ttgctgtatt gccattagcg aaaattcccg agctagttgt 2460 

318 agttgatttc ctggaacgct gggggagtgc cgctcagatg ttcatctcca ataagcccct 2520 

320 caatgaatct tcacttcatc ggatccaagg tcaatcttcg agatcaagtg caagttgccc 2580 

322 agaaagcacg ggtaaagaaa ccaagcctat ttctattcta tggtctaatg taaactaaaa 2640 

324 atgtagaagg aagaaaagca agtatccaac agtaggcggg tcatgacatg cgtgtgcgct 2700 

326 aaggatatat acatttcgaa ttgcaaagag ggaagaggtg aatcaggagt gaaatgtgtg 2 760 

328 tcaagaggca atgtcaatgt caagatcatt gttgctctca tgagcagtca cggattgtgt 2820 

330 cggattgttc ggcgtctggg gccctcagat tctatttctg ggtcatgagc ttgagagtag 2880 

332 gtaccgaaga agtgagcagt attatactgc agtgagtgtt tagggggaat tccttctggt 2940 

334 gaattgtggc gttcggggtt gctctccggt cttatgggtc ttaatctgga tgcccgatag 3000 

336 tgcacccaag ttaggagaaa aacatatggt aagtgttaat cgtggagcag tgtggcgaat 3060 

338 cgcgaattgg gtttggcact tagatttcga tggcgctaga gacgccgttg gcgcgagcac 3120 

340 catcgacctc atttttatgc gcgtgggaca ttgctgcaag agttttgagc atcgaatccc 3180 

342 gcgtcgac 3188 

345 <210> SEQ ID NO: 5 

346 <211> LENGTH: 37 

347 <212> TYPE: DNA 

348 <213> ORGANISM: Artificial Sequence 

350 <220> FEATURE: 

351 <223> OTHER INFORMATION: oligonucleotide primer 

353 <400> SEQUENCE: 5 

354 ccccgtcgac cgaagtggtt tcggttcact cgcacat 37 

357 <210> SEQ ID NO: 6 

358 <211> LENGTH: 37 

359 <212> TYPE: DNA 

360 <213> ORGANISM: Artificial Sequence 
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RAW SEQUENCE LISTING DATE: 11/03/2006 

PATENT APPLICATION: US/10/575 , 560A TIME: 10:30:26 

Input Set : A:\33412_SEQ ID. ST25.txt 
Output Set: N:\CRF4\11032006\J575560A.raw 

362 <220> FEATURE: 

363 <223> OTHER INFORMATION: oligonucleotide primer 

365 <400> SEQUENCE: 6 

366 ccccgtcgac gcgggattcg atgctcaaaa ctcttgc 37 

369 <210> SEQ ID NO: 7 

370 <211> LENGTH: 41 

371 <212> TYPE: DNA ; ..... 
372'<213> ORGANISM: Artificial Sequence ; - 

374 <220> FEATURE: 

375 <223> OTHER INFORMATION: oligonucleotide primer 

377 <400> SEQUENCE: 7 

378 gggggccgag gcggcccatg gataccaatg atatcaaaca g 41 

381 <210> SEQ ID NO: 8 

382 <211> LENGTH: 43 

383 <212> TYPE: DNA 

3 84 <213> ORGANISM: Artificial Sequence 

386 <220> FEATURE: 

387 <223> OTHER INFORMATION: oligonucleotide primer 

389 <400> SEQUENCE: 8 ...... .... .m ^jr.*-' • -..^ 

390 gggggccatt atggcctcat tcaggactac ctgccgcgaa acg 43 ' ' 
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VERIFICATION SUMMARY DATE: 11/03/2006 

PATENT APPLICATION: US/10/575, 560A TIME: 10:30:27 

Input Set : A:\33412_SEQ ID. ST25.txt 
Output Set: N:\CRF4\11032006\J575560A.raw 
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